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SEQUENCE LISTING 

<110> National Institute of Advanced Industrial Science and Technology- 
National Institute of Technology and Evaluation 
National Research Institute of Brewing 

<120> Pyroglutamyl peptidase and a gene thereof 

<130> A21773A 

<150> JP 2001-403261 

<151> 2001-12-27 

<160> 10 

<170> Patentln version 3. 1 

<210> 1 
<211> 2201 
<212> DNA 

<213> Aspergillus oryzae 
<220> 

<223> Inventor: Machida, Masayuki; Abe, Keietsu; Gorai, Katsuya; 
Inventor: Asai, Kiyoshi; Sano, Motoaki ; Kin, Taishin 
Inventor: Nagasaki, Hideki; Hosoyama, Akira; Akita, Osamu 
Inventor: Ogasawara, Naotake; Kuhara, Satoru; Tokunaga, Chikara 
Inventor: Toda, Itaru; Saitoh, Chiaki; Senoh, Akihiro 

<220> 

<221> source 
<222> (1)..(2201) 

<223> /organisra="Aspergillus oryzae" 
/strain="RIB 40" 
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<220> 

<221> CDS 

<222> (1001). . (1111) 
<220> 

<221> CDS 

<222> (1197). . (1901) 
<220> 

<221> Intron 

<222> (1112). . (1196) 

<223> /number=l 

<400> 1 



tcaagatctg 


aatgataaca 


gatttcctag 


tccattgtaa 


cacatcgatg 


gggttgacgt 


60 


ggtaatagaa 


tcccaagact 


gatggaaatc 


gttctaggtg 


aagtcaaagt 


ctagatgatg 


120 


aataaacaac 


cagactctag 


gaaaatgctg 


gtctagaccc 


ttgggcgaga 


aggaatgtgc 


180 


tgataacgtc 


tcgctgcctt 


tcagcggtaa 


cgctaatcta 


aaagatcaac 


aaacaatcca 


240 


ggagcaacca 


gagcaatcgg 


tgcgtgttca 


gtaagtgagc 


agtgagtgca 


caggagcact 


300 


cacgtgctaa 


ccttacaaaa 


gcagcggcac 


ccatatcaaa 


caggaagaag 


tgggccgtac 


360 


ggtagttcta 


ggatgacata 


ccgaaacccc 


ttatttgttc 


gcttaaatag 


atccctgccc 


420 


agctttactg 


atggatttct 


aatcgcaaag 


taattgggtg 


aaataccatc 


ggtattaacc 


480 


tagtgaatgg 


tgattctcaa 


ccatcgagta 


caagtcattc 


tcactattga 


actttccaaa 


540 


aagccccgtg 


aacaagcagt 


ctgcggtttg 


ccccggctga 


agcaaggggg 


aaattgtcgg 


600 


tcaggactcg 


gaacttcgga 


agcgaagcag 


aatcggcggt 


ggccaaaagg 


catgcgacgt 


660 


gacagcacct 


cacatcattc 


cgggacaata 


acataggttc 


aattgcacaa 


ttgtctcaag 


720 


aacatgggtg 


attgtcagat 


tgatacgtca 


atcaagcttt 


gtgggcggtc 


aagatgaggg 


780 


gaggtcatgt 


gccttatcac 


cttatcgata 


tcgatatcgc 


gtgatgccaa 


gacctgcatg 


840 


cggtggttgt 


aatgcggggg 


aagctccgtc 


gatatcttga 


atatatcttt 


agtccctcct 


900 


ctctatcctt 


tttgtggcgt 


acatagctac 


cgtgtatata 


cgaagtaaag 


gcgttgttcc 


960 
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caccactgat tcctagcttg ccttgaccta tccactagcc atg gga gac ttt ggc 1015 

Met Gly Asp Phe Gly 
1 5 

cca ccg gtg cca ata ccc gag acg gag gta att ggt ctt get teg tea 1063 
Pro Pro Val Pro He Pro Glu Thr Glu Val He Gly Leu Ala Ser Ser 

10 15 20 

tct ttg aca gat cca gaa gag gtc teg gta ctg gtg aca ggg ttc ggt 1111 
Ser Leu Thr Asp Pro Glu Glu Val Ser Val Leu Val Thr Gly Phe Gly 

25 30 35 

gtaagtttat ctgtcttatg cttggcttat gtttctttga ccgtcttggc ttctgattcg 1171 
gccctcaaat gctaaaatat actag cca ttc aag acc aac eta gtc aat gec 1223 

Pro Phe Lys Thr Asn Leu Val Asn Ala 
40 45 
teg tat ttg att gec tea tct ctg cca gag teg ctt gac ctt cct teg 1271 
Ser Tyr Leu He Ala Ser Ser Leu Pro Glu Ser Leu Asp Leu Pro Ser 

50 55 60 

gcg aag ccg tct gga tec ggg cct act tct cgt egg att tea att cat 1319 
Ala Lys Pro Ser Gly Ser Gly Pro Thr Ser Arg Arg He Ser He His 

65 70 75 

gtc cat cca teg ccc att ccc gtc get tac tea aca gtg egg aca act 1367 
Val His Pro Ser Pro He Pro Val Ala Tyr Ser Thr Val Arg Thr Thr 

80 85 90 

att cca acc ate eta gag gat tac gec aag tec cat gga ggt cga cga 1415 
He Pro Thr He Leu Glu Asp Tyr Ala Lys Ser His Gly Gly Arg Arg 
95 100 105 110 

cca gac att gta etc cat atg gga ata gcg get aca aga teg tac tac 1463 
Pro Asp He Val Leu His Met Gly He Ala Ala Thr Arg Ser Tyr Tyr 
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115 

teg att gag acc aag gcg 
Ser lie Glu Thr Lys Ala 
130 

aaa ggc aga ate ggt tat 
Lys Gly Arg lie Gly Tyr 
145 

cag etc ccg cca gta etc 
Gin Leu Pro Pro Val Leu 
160 

gta gta egg aaa gtt etc 
Val Val Arg Lys Val Leu 
175 180 
aac acg tgg aag teg ttt 
Asn Thr Trp Lys Ser Phe 
195 

gag gac get gga cgc tac 
Glu Asp Ala Gly Arg Tyr 
210 

gee cag gcg ttt caa caa 
Ala Gin Ala Phe Gin Gin 
225 

gtg cct gga tct tgc gee 
Val Pro Gly Ser Cys Ala 
240 

gca get gga ttg ate aaa 
Ala Ala Gly Leu lie Lys 



120 

cat cga gat tct tac cac 
His Arg Asp Ser Tyr His 
135 

gaa gat ggg gag aag gtt 
Glu Asp Gly Glu Lys Val 
150 

cag get ggt cct gcg gcg 
Gin Ala Gly Pro Ala Ala 
165 170 
cac ccc cag ccg ccc aat 
His Pro Gin Pro Pro Asn 
185 

gta tct cct gga gca gac 
Val Ser Pro Gly Ala Asp 
200 

etc tgc gag ttc ate ttt 
Leu Cys Glu Phe He Phe 
215 

ggc cag cac cga aac gtc 
Gly Gin His Arg Asn Val 
230 

gac gag gac ate gag aga 
Asp Glu Asp He Glu Arg 
245 250 
get ctt gta aga tgt tgg 
Ala Leu Val Arg Cys Trp 
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125 

ttg tec gat ate 1511 
Leu Ser Asp He 
140 

tgg agg gag cag 1559 

Trp Arg Glu Gin 

155 

gat tec aca gac 1607 
Asp Ser Thr Asp 

gac gac ttt etc 1655 
Asp Asp Phe Leu 
190 

gtc egg ata tec 1703 
Val Arg He Ser 
205 

tat aca agt ctg 1751 
Tyr Thr Ser Leu 
220 

gtt ttc ttc cat 1799 

Val Phe Phe His 

235 

ggc acg gat att 1847 
Gly Thr Asp He 

gtt age gag cag 1895 
Val Ser Glu Gin 



255 260 265 270 

gta tag agcggcatgc aggttgctgg tatcgttttg caaagcaaga gcatgggcac 1951 

Val 

tggacgatat atatacttgc atttctatgg cgcggtgcac tatctgggtt ccggatgcgc 2011 

ttttagctgc agtcactcgt gatcatttat ttatagggga cttctgtccc cggtcttttc 2071 

aggttgagtt atacatgttt cacaggtttt ggatacacta tttaccctct gactactatc 2131 

gatgaatata gacagttgtc aagcatgata ttgggttcta ccgtattcgt atatgtgtag 2191 

aatttcccgt 2201 

<210> 2 
<211> 271 
<212> PRT 

<213> Aspergillus oryzae 
<400> 2 
Met Gly Asp Phe 
1 

Gly Leu Ala Ser 
20 

Val Thr Gly Phe 
35 

Leu lie Ala Ser 
50 

Pro Ser Gly Ser 
65 

Pro Ser Pro He 
Thr He Leu Glu 
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Gly Pro Pro Val Pro He 
5 10 
Ser Ser Leu Thr Asp Pro 
25 

Gly Pro Phe Lys Thr Asn 
40 

Ser Leu Pro Glu Ser Leu 
55 

Gly Pro Thr Ser Arg Arg 
70 

Pro Val Ala Tyr Ser Thr 
85 90 
Asp Tyr Ala Lys Ser His 



Pro Glu Thr Glu Val He 
15 

Glu Glu Val Ser Val Leu 
30 

Leu Val Asn Ala Ser Tyr 
45 

Asp Leu Pro Ser Ala Lys 
60 

He Ser He His Val His 
75 80 
Val Arg Thr Thr He Pro 
95 

Gly Gly Arg Arg Pro Asp 



t • » ' 



100 105 110 

He Val Leu His Met Gly He Ala Ala Thr Arg Ser Tyr Tyr Ser He 

115 120 125 

Glu Thr Lys Ala His Arg Asp Ser Tyr His Leu Ser Asp He Lys Gly 

130 135 140 

Arg He Gly Tyr Glu Asp Gly Glu Lys Val Trp Arg Glu Gin Gin Leu 
145 150 155 160 

Pro Pro Val Leu Gin Ala Gly Pro Ala Ala Asp Ser Thr Asp Val Val 

165 170 175 

Arg Lys Val Leu His Pro Gin Pro Pro Asn Asp Asp Phe Leu Asn Thr 

180 185 190 

Trp Lys Ser Phe Val Ser Pro Gly Ala Asp Val Arg He Ser Glu Asp 

195 200 205 

Ala Gly Arg Tyr Leu Cys Glu Phe He Phe Tyr Thr Ser Leu Ala Gin 

210 215 220 

Ala Phe Gin Gin Gly Gin His Arg Asn Val Val Phe Phe His Val Pro 
225 230 235 240 

Gly Ser Cys Ala Asp Glu Asp lie Glu Arg Gly Thr Asp He Ala Ala 

245 250 255 

Gly Leu He Lys Ala Leu Val Arg Cys Trp Val Ser Glu Gin Val 
260 265 270 



<210> 3 

<211> 1000 

<212> DNA 

<213> Aspergillus oryzae 
<220> 
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<221> source 
<222> (1). . (1000) 

<223> /organism="Aspergillus oryzae 

/strain="RIB 40" 
<400> 3 



tcaagatctg 


aatgataaca 


gatttcctag 


tccattgtaa 


cacatcgatg 


gggttgacgt 


60 


ggtaatagaa 


tcccaagact 


gatggaaatc 


gttctaggtg 


aagtcaaagt 


ctagatgatg 


120 


aataaacaac 


cagactctag 


gaaaatgctg 


gtctagaccc 


ttgggcgaga 


aggaatgtgc 


180 


tgataacgtc 


tcgctgcctt 


tcagcggtaa 


cgctaatcta 


aaagatcaac 


aaacaatcca 


240 


ggagcaacca 


gagcaatcgg 


tgcgtgttca 


gtaagtgagc 


agtgagtgca 


caggagcact 


300 


cacgtgctaa 


ccttacaaaa 


gcagcggcac 


ccatatcaaa 


caggaagaag 


tgggccgtac 


360 


ggtagttcta 


ggatgacata 


ccgaaacccc 


ttatttgttc 


gcttaaatag 


atccctgccc 


420 


agctttactg 


atggatttct 


aatcgcaaag 


taattgggtg 


aaataccatc 


ggtattaacc 


480 


tagtgaatgg 


tgattctcaa 


ccatcgagta 


caagtcattc 


tcactattga 


actttccaaa 


540 


aagccccgtg 


aacaagcagt 


ctgcggtttg 


ccccggctga 


agcaaggggg 


aaattgtcgg 


600 


tcaggactcg 


gaacttcgga 


agcgaagcag 


aatcggcggt 


ggccaaaagg 


catgcgacgt 


660 


gacagcacct 


cacatcattc 


cgggacaata 


acataggttc 


aattgcacaa 


ttgtctcaag 


720 


aacatgggtg 


attgtcagat 


tgatacgtca 


atcaagcttt 


gtgggcggtc 


aagatgaggg 


780 


gaggtcatgt 


gccttatcac 


cttatcgata 


tcgatatcgc 


gtgatgccaa 


gacctgcatg 


840 


cggtggttgt 


aatgcggggg 


aagctccgtc 


gatatcttga 


atatatcttt 


agtccctcct 


900 


ctctatcctt 


tttgtggcgt 


acatagctac cgtgtatata cgaagtaaag gcgttgttcc 


960 


caccactgat 


tcctagcttg 


ccttgaccta 


tccactagcc 






1000 



<210> 4 

<211> 300 

<212> DNA 

<213> Aspergillus oryzae 
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<220> 

<221> source 
<222> (1). . (300) 

<223> /organism="Aspergillus oryzae" 

/strain="RIB 40" 
<400> 4 

agcggcatgc aggttgctgg tatcgttttg caaagcaaga gcatgggcac tggacgatat 60 
atatacttgc atttctatgg cgcggtgcac tatctgggtt ccggatgcgc ttttagctgc 120 
agtcactcgt gatcatttat ttatagggga cttctgtccc cggtcttttc aggttgagtt 180 
atacatgttt cacaggtttt ggatacacta tttaccctct gactactatc gatgaatata 240 
gacagttgtc aagcatgata ttgggttcta ccgtattcgt atatgtgtag aatttcccgt 300 

<210> 5 
<211> 816 
<212> DNA 

<213> Aspergillus oryzae 
<220> 

<221> source 
<222> (1). . (816) 

<223> /organism="Aspergillus oryzae" 
/strain="RIB 40" 

<220> 

<221> CDS 

<222> (1). . (816) 

<400> 5 

atg gga gac ttt ggc cca ccg gtg cca ata ccc gag acg gag gta att 48 
Met Gly Asp Phe Gly Pro Pro Val Pro He Pro Glu Thr Glu Val He 
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1 5 
ggt ctt get teg tea tct 
Gly Leu Ala Ser Ser Ser 
20 

gtg aca ggg ttc ggt cca 
Val Thr Gly Phe Gly Pro 
35 

ttg att gee tea tct ctg 
Leu lie Ala Ser Ser Leu 
50 

ccg tct gga tec ggg cct 
Pro Ser Gly Ser Gly Pro 
65 70 
cca teg ccc att ccc gtc 
Pro Ser Pro He Pro Val 
85 

ace ate eta gag gat tac 
Thr He Leu Glu Asp Tyr 
100 

att gta etc cat atg gga 
lie Val Leu His Met Gly 
115 

gag ace aag gcg cat cga 
Glu Thr Lys Ala His Arg 
130 

aga ate ggt tat gaa gat 
Arg He Gly Tyr Glu Asp 



10 

ttg aca gat cca gaa gag 
Leu Thr Asp Pro Glu Glu 
25 

ttc aag ace aac eta gtc 
Phe Lys Thr Asn Leu Val 
40 

cca gag teg ctt gac ctt 
Pro Glu Ser Leu Asp Leu 
55 60 
act tct cgt egg att tea 
Thr Ser Arg Arg He Ser 
75 

get tac tea aca gtg egg 
Ala Tyr Ser Thr Val Arg 
90 

gee aag tec cat gga ggt 
Ala Lys Ser His Gly Gly 
105 

ata gcg get aca aga teg 
He Ala Ala Thr Arg Ser 
120 

gat tct tac cac ttg tec 
Asp Ser Tyr His Leu Ser 
135 140 
ggg gag aag gtt tgg agg 
Gly Glu Lys Val Trp Arg 



15 

gtc teg gta ctg 96 
Val Ser Val Leu 
30 

aat gee teg tat 144 

Asn Ala Ser Tyr 

45 

cct teg gcg aag 192 
Pro Ser Ala Lys 

att cat gtc cat 240 
He His Val His 
80 

aca act att cca 288 
Thr Thr He Pro 
95 

cga cga cca gac 336 
Arg Arg Pro Asp 
110 

tac tac teg att 384 

Tyr Tyr Ser lie 

125 

gat ate aaa ggc 432 
Asp He Lys Gly 

gag cag cag etc 480 
Glu Gin Gin Leu 
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145 150 155 160 

ccg cca gta etc cag get ggt cct gcg gcg gat tec aca gac gta gta 528 

Pro Pro Val Leu Gin Ala Gly Pro Ala Ala Asp Ser Thr Asp Val Val 

165 170 175 

egg aaa gtt etc cac ccc cag ccg ccc aat gac gac ttt etc aac acg 576 
Arg Lys Val Leu His Pro Gin Pro Pro Asn Asp Asp Phe Leu Asn Thr 

180 185 190 

tgg aag teg ttt gta tct cct gga gca gac gtc egg ata tec gag gac 624 
Trp Lys Ser Phe Val Ser Pro Gly Ala Asp Val Arg lie Ser Glu Asp 

195 200 205 

get gga cgc tac etc tgc gag ttc ate ttt tat aca agt ctg gee cag 672 
Ala Gly Arg Tyr Leu Cys Glu Phe He Phe Tyr Thr Ser Leu Ala Gin 

210 215 220 

gcg ttt caa caa ggc cag cac cga aac gtc gtt ttc ttc cat gtg cct 720 
Ala Phe Gin Gin Gly Gin His Arg Asn Val Val Phe Phe His Val Pro 
225 230 235 240 

gga tct tgc gee gac gag gac ate gag aga ggc acg gat att gca get 768 
Gly Ser Cys Ala Asp Glu Asp He Glu Arg Gly Thr Asp He Ala Ala 

245 250 255 

gga ttg ate aaa get ctt gta aga tgt tgg gtt age gag cag gta tag 816 
Gly Leu He Lys Ala Leu Val Arg Cys Trp Val Ser Glu Gin Val 
260 265 270 

<210> 6 
<211> 16 
<212> DNA 
<213> Artificial 

10/14 



<220> 

<223> adaptor 
<400> 6 

cgagagcggc cgctac 16 

<210> 7 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> adaptor 

<400> 7 

gtagcggccg etc 13 

<210> 8 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> primer for amplification of Aspergillus oryzae pyroglutamyl pepti 

dase cDNA 
<400> 8 

gggctgeaga tgggagactt tggcccacc 29 

<210> 9 
<211> 30 
<212> DNA 
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<213> Artificial 
<220> 

<223> primer for amplification of Aspergillus oryzae pyroglutamyl pepti 

dase cDNA 
<400> 9 

ggggaattcc tatacctgct cgctaaccca 30 

<210> 10 
<211> 312 
<212> PRT 
<213> Artificial 
<220> 

<223> amino acid sequence wherein 41 amino acids containing polyhistidi 
ne-tag are added at N-terminal of amino acid sequence of SEQ ID N 
0:2 

<400> 10 

Met Arg Gly 

1 

Gly Gly Gin 

Arg Trp lie 
35 

Val Pro He 
50 

Thr Asp Pro 
65 

Lys Thr Asn 
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Ser His His His His His His Gly Met Ala Ser Met Thr 

5 10 15 

Gin Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 
20 25 30 

Arg Pro Arg Asp Leu Gin Met Gly Asp Phe Gly Pro Pro 

40 45 
Pro Glu Thr Glu Val He Gly Leu Ala Ser Ser Ser Leu 

55 60 
Glu Glu Val Ser Val Leu Val Thr Gly Phe Gly Pro Phe 

70 75 80 

Leu Val Asn Ala Ser Tyr Leu He Ala Ser Ser Leu Pro 



85 

Glu Ser Leu Asp Leu 
100 

Ser Arg Arg lie Ser 
115 

Tyr Ser Thr Val Arg 
130 

Lys Ser His Gly Gly 
145 

Ala Ala Thr Arg Ser 
165 

Ser Tyr His Leu Ser 
180 

Glu Lys Val Trp Arg 
195 

Pro Ala Ala Asp Ser 
210 

Pro Pro Asn Asp Asp 
225 

Gly Ala Asp Val Arg 
245 

Phe He Phe Tyr Thr 
260 

Arg Asn Val Val Phe 
275 

He Glu Arg Gly Thr 
290 



Pro Ser Ala Lys 
105 

He His Val His Pro Ser 
120 

Thr Thr He Pro Thr He 
135 

Arg Arg Pro Asp He Val 
150 155 
Tyr Tyr Ser He Glu Thr 
170 

Asp He Lys Gly Arg He 
185 

Glu Gin Gin Leu Pro Pro 
200 

Thr Asp Val Val Arg Lys 
215 

Phe Leu Asn Thr Trp Lys 
230 235 
He Ser Glu Asp Ala Gly 
250 

Ser Leu Ala Gin Ala Phe 
265 

Phe His Val Pro Gly Ser 
280 

Asp He Ala Ala Gly Leu 
295 
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95 

Ser Gly Pro Thr 
110 

Pro He Pro Val Ala 
125 

Leu Glu Asp Tyr Ala 
140 

Leu His Met Gly He 
160 

Lys Ala His Arg Asp 
175 

Gly Tyr Glu Asp Gly 
190 

Val Leu Gin Ala Gly 
205 

Val Leu His Pro Gin 
220 

Ser Phe Val Ser Pro 
240 

Arg Tyr Leu Cys Glu 
255 

Gin Gin Gly Gin His 
270 

Cys Ala Asp Glu Asp 
285 

He Lys Ala Leu Val 
300 



90 

Pro Ser Gly 



Arg Cys Trp Val Ser Glu Gin Val 
305 310 
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